Exact-Regenerating Codes between MBR and MSR 
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Abstract — In this paper we study distributed storage systems 
with exact repair. We give a construction for regenerating codes 
between the minimum storage regenerating (MSR) and the 
minimum bandwidth regenerating (MBR) points and show that 
in the case that the parameters n, k, and d are close to each 
other our constructions are close to optimal when comparing to 
the known capacity when only functional repair is required. We 
do this by showing that when the distances of the parameters n, 
k, and d are fixed but the actual values approach to infinity, the 
fraction of the performance of our codes with exact repair and 
the known capacity of codes with functional repair approaches 
to one. 

I. Introduction 

A. Regenerating Codes 

In a distributed storage system a file is dispersed across n 
nodes in a network such that given any k {< n) of these nodes 
one can reconstruct the original file. We also want to have such 
a redundancy in our network that if we lose a node then any 
d{< n) of the remaining nodes can repair the lost node. We 
assume that each node stores the amount a of information, e.g., 
a symbols over a finite field, and in the repair process each 
repairing node transmits the amount /3 to the new replacing 
node (called a newcomer) and hence the total repair bandwidth 
is 7 = d/3. We also assume that k < d. 

The repair process can be either functional or exact. By 
functional repair we mean that the nodes may change over 
time, i.e., if a node vf'^ is lost and in the repair process we get 
a new node v"™ instead, then we may have vf'^ ^ w"™. If only 
functional repair is assumed then the capacity of the system, 
denoted by Ck,d{<^, 7), is known. Namely, it was proved in the 
pioneering work by Dimakis et al. 1 1 1 that 
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If the size of the stored file is fixed to be B then the above 
expression for the capacity defines a tradeoff between the node 
size a and the total repair bandwidth 7. The two extreme points 
are called the minimum storage regeneration (MSR) point and 
the minimum bandwidth regeneration (MBR) point. The MSR 
point is achieved by first minimizing a and then minimizing 
7 to obtain 

clB (I) 



By first minimizing 7 and then minimizing a leads to the MBR 
point 
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In this paper we are interested in codes that have exact 
repair The concepts of exact regeneration and exact repair 
were introduced independently in ^, [Jl, and lIU. Exact repair 
means that the network of nodes does not vary over time, i.e., 
if a node vf'^ is lost and in the repair process we get a new 
node wf*, then vf'^ = wf". We denote by 

the capacity of codes with exact repair with n nodes each of 
size a, with total repair bandwidth 7, and for which each set 
of k nodes can recover the stored file and each set of d nodes 
can repair a lost node. 

We have by definition that 

QXdK7)<CM(«,7)- 

It was proved in ||5l, Q, and lH] that the codes with exact 
repair achieve the MSR point and in |5| that the codes with 
exact repair achieve the MBR point. The impossibility of 
constructing codes with exact repair at essentially all interior 
points on the storage-bandwidth tradeoff curve was shown in 



B. Contributions and Organization 

In Section III] we give a construction for codes between 
MSR and MBR points with exact repair. In Section [III] we 
derive some inequalities from our construction. Section IV 
provides an example showing that, in the special case of 
n — k + \ — d+1, our construction is close to optimal when 
comparing to the known capacity when only functional repair 
is required. In Section W\ we show that when the distances 
of the parameters n, k, and d are fixed but the actual values 
approach to infinity, the fraction of performance of our codes 
with exact repair and the known capacity of functional-repair 
codes approaches to one. 



II. Construction 

Assume we have a storage system DSSi with exact repair 
for parameters 

{n, k,d) 

with a node size a and the total repair bandwidth j — df3. In 
this section we propose a construction that gives a new storage 
system for parameters 

{n ^n + l,k' = k + l,d' = d+ 1). 

Let DSSi consist of nodes vi, . . . , u„, and let the stored file 
F be of maximal size ^"^'^(0,7). 

Let then DSSi^ denote a new system consisting of the 
original storage system DSSi and one extra node w„+i storing 
nothing. It is clear that DSSi+ is a storage system for 
parameters 

(n + l.fc + l.d+l) 



DSS'^™ the j'th node stores nothing while the other nodes are 
as those in the original system DSSi. 

Using these four new systems as building blocks we 
construct a new system DSS2 such that its jth node for 
j — 1, ... ,4 stores the jth node from each system ZJS'S'"'^* for 
i = 1, . . . , 4. Hence each node in DSS2 stores (4— l)a = 3a 
and the total repair bandwidth is (4 — 1)7 — 87. 

For example, if the original system DSSi consists of nodes 
vi storing x, V2 storing y, and ^3 storing x + y then DSS"™ 
consists of nodes uu storing nothing, U12 storing xi, W13 
storing yi, and U14 storing xi+yi. Similarly DSS2™ consists 
of nodes U21 storing X2, U22 storing nothing, U23 storing y2, 
and W24 storing 2:2+1/2 and so on. Then in the resulting system 
the first node wi consists of nodes un (storing nothing), U21 
(storing X2), M31 (storing X3), and U41 (storing X4). The stored 
file is {xi,x2,X3,X4,yi,y2,y3,y4)- 



and can store the original file F. 

Let <Tj be the permutations of the set {1, . . . , n+1} for j = 
1, . . . , (n + 1)! . Assume that DSS"™ is a storage system for 
j = 1, . . . , (n + 1)! corresponding to the permutation aj such 
that DSS'^™ is exacdy the same as DSSi+ except that the 
order of the nodes is changed corresponding to the permutation 
a J, i.e., the ith node in DSSi+ is the CT-,(i)th node in DSS'^™. 

Using these {n + 1)! new systems as building blocks we 
construct a new system DSS2 such that its jth node for j = 
1, . . . , ri + 1 stores the jth node from each system DSSf^"" for 
z = l,...,(n + l)!. 

It is clear that this new system DSS2 works for parameters 
{n + l,k + l,d+l), has exact repair property, stores a file of 
size (n + 1)!C'™'^'c;(q^:7) and has a node size 

c*2 = (("■ + 1)! ^ n\)a = n ■ n\a 
and total repair bandwidth 

72 = ((''* + 1)! ^ n\)^ = n ■ n\^ . 

Moreover, because of the symmetry of the construction we 
have [32 — n ■ n\f3 . 



This construction implies the inequality 
that is. 



C:rlk+i,d+iin ■ n\a, n ■ n\j) > {n + l)!QX,(a, 7), 



C„+i,fe+i,d+i(a,7) ^ 
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QT,(«,7). 



(3) 



Example 2.1: If we relax on the typical requirement of a 
DSS to be homogeneous, meaning that each node is transmit- 
ting the same amount /3 of information in the repair process, 
and instead only require that the total repair bandwidth 7 is 
constant {i.e., (3 may take different values depending on the 
node), then we can build our construction a little easier Let 
{n, k, d) — (3, 2, 2) and DSSi be a distributed storage system 
with exact repair. Let DSS"™ be a storage system with 4 
nodes for j = 1, . . . , 4 where the jth node stores nothing, the 
ith node for i < j stores as the ith node in the original system 
DSSi, and the ith node for i > j stores as the {i — l)th node 
in the original system DSSi. That is, in the jth subsystem 
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Fig. 1. The figure illustrates the DSS built in Example |2. 1 1 It consists of 
nodes w\, 102, U13, and U14. 



III. Inequalities from the Construction 

Next we will derive some inequalities for the capacity in 
the case of exact repair. Using Equation [3] inductively we get 

Theorem 3.1: For an integer j € [0, fc — 1] we have 



Q':M(a,7)> 
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y^exact 

ri-j,k-j,d-j 



(a,7)- 



It is proved in 0, El, and \&\ that the MSR point can be 
achieved if exact repair is assumed. As a consequence of this 
and Theorem [3] we get the following bound. 

Theorem 3.2: For integers 1 < i < fc we have 

(d — fc + i)ot\ ^ nia 



^exact 



d-k + l 



> 



— k + i 



Proof: Write n' = n — j,k' = k — j,d' = d — j, a — p, 



d'B 



^^^ ^ "" k'id'-t'+i) - ^^ i^ proved in 0, Q, and |I8| that 

CTk'.d'ic^,l)^B, 



inexact 
ri-j,k-j,d-j 



"'i^fcTi) = (^~^'^"- 



Hence by Theorem |3.1| we have 



M 



C exact , 
nM,d I "i 



{d - i)a \ ^ n{k-j)a 



d-k + l 



n-j 



Now a change of variables by setting i — k — j gives us 
the result. ■ 



IV. Example: Case n = k + l = d + l 

In this section we study the special case n — k + 1 = d+1 
and compare it to the known capacity with the assumption of 
functional repair, 



n-2 



C„_i,„_i(a,7) = ^ 

j=0 



n — 1 — j 

mm <, a, ; — 7 

n — 1 



Now our bound gives 

77 



SO we can write 



fn{^) 



1 + i 



for integers i = 1, . . . ,k. 



Notice that now in the extreme points our lower bound 
achieves the known capacity, i.e.. 



C,l';^-l.n-l («,«) = /„(!) 



na 



for the MBR point and 

C™ii,„_i(a, ka) = Uk) = (n - l)a 
for the MSR point. 

As an example we study the fraction 



ma 
l + i 



for integers i E [l,k]. Writing it out we see that 



Cn-i,n-iia,ia) 



i+t 



E=oi + E 



n—2 71 — 1— j 
j=T+l n-1 ' 
ni 

1+i 



(4) 



where T ^ [(n - 1)(1 - \)\ 

For large values of n this is approximately 



2f 



> 



2i^ + i-l - 9 



for all i = 1, . . . , fc. 
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Fig. 2. The figure shows the performance M of our construction (dotted 
curve) between the capacity of functionally repairing codes (uppermost curve) 
and the trivial lower bound given by inteipolation of the known MSR and MBR 
points when (n, k, d) = (51, 50, 50), a = 1, and 7 S [1, 50]. 



V. The case when n, k and d are close to each 

OTHER 

Next we will study the special case where n, k and d are 
close to each other We will do this by setting um = n + M, 
ki\i = k + M and di\i = d + M and letting M — > 00, and then 
examine how the capacity curve asymptotically behaves. The 
example in the previous section showed us that in that special 
case our bound is quite close to the capacity of functionally 
regenerating codes. However, in the previous section we fixed 
i to be an integer and then assumed that n is large. In this 
section we tie up the values i and Af together to arrive at 
a situation where the total repair bandwidth stays on a fixed 
point between its minimal possible value given by the MBR 
point and its maximal possible value given by the MSR point. 



For each M the bound from Theorem 3.2 gives 



c: 



nM :^M-dM 



[dM - kM + i)ct\ ^ riMia 
a, — z z I > 



dhi — ^Af + l J ~ n- k 
for i = 1, . . . , kM, hence in this section we write 

UMia 



9M{i) 



n — k + i 



for integers i = 1 , . . . . fc and extend this definition for x £ 
[1, fc] such that gM{x) is the piecewise linear curve defined by 

Let s e (0, 1] be a fixed number and i = 1 + s{kM — !)• 
We will study how the fraction 

5m(«) 



1^ f {dM — kM+i)a \ 

behaves as we let M ^ 00. Informally this tells how close our 
lower bound curve and the known capacity curve are to each 
other when M is large, i.e., values nM,k]\i,dM are close to 
each other 

Remark 5.1: In the MSR point we have 

dMC( 



IMSR — 



l-M 



tM ■ 



1 



and in the MBR point 



JMBR = a- 



Hence 



d-M - kM + 1 , ,, ^ 

a ■ ; — - = S-fMSR + (1 - SHMBR- 

d-M -Km + 1 



Theorem 5.1: Let s G (0, 1] be a fixed number and i = 

1 + s{kM — 1). Then 

,• 5m(«) , 

lin^ t:^ i:r-z^^ — = 1- 

M^oo r< J (n (d-M-kM+i)a \ 
^kMAuK'-^J dM-fcM + 1 -' 

Proof: Let i = 1 + s{kM — !)■ We study the behavior of 
the fraction for large Af , so we have ^ k. \. Thus, to simplify 
the notation, we may assume that i acts as an integer. We also 
use the notation 



i = 



dMs{kM - 1) 



We have 

5m(1 + s{kM - 1)) 



d - fc + 1 + s{kM - 1) ■ 

nAf(l + s{kM - l))a 



n — k + i 



and 



Cfc„,dM(": 



(dM - kM + ^)"^ 



dM - kM + 1 

t kM — 'i- 

^j=0 j=t+l 

(kM - t - l)(2d + M -k- t){d - k + i) 
2dM{d-k + l) 



dM — j d — k + i 
dM d — k + 1 



=a\ t+1 



whence 



9M{i) 



f-i ( {dM — kM+i)a \ 

'-'kM^duK^i d„j-kM+l > 

hijM) 

'h2{M){h3{M) + h4{M))' 



(5) 



(6) 



where 



hiiM) = 2nM(l + s{kM - l))dMid -k + 1), 
h2{M) =n-k + l + s{kM - 1), 
hsiM) = 2{t + l)dM{d - k + 1), 



and 



hi{M) = {kM~t-l){2d-k + M-t){d-k + l + s{kM-l))- 

Now it is easy to check that 

hi{M) 



M3 



-> 2s{d-k + l), 



h2{M) 



M 



^s, 



and 



M2 



2{d-k + l) 



as M — >■ cx). 
Note that 



M-t 



d — fc + 1 — ds 



when M is large and hence 

hi{M) 

IVP 
{kM -t-l){2d-k + M -t) d-k + l + s{kM - 1) 



M 



M 



as A/ ^- oo. 
Finally, 



(7) 



5A/(J) 



^ / (dM—kM+i)a ' 



/tl(M) 



/t2(M) h3(M)+;i4(M) 

2s(d-fe + l) 



(8) 



s{2{d - k + 1) + 0) 
as Af — > oo, proving the claim. 

■ 
As a straightforward corollary to Theorem |5.2| we have 

Theorem 5.2: Let s E [0, 1] be a fixed number and let 

IMSR = d,fX,+i ^'^ ^^^BR = a. Then 

^.^ C«''M.fcM.dM(°'' ^^MSR + (1 - g)7A/Bfl) ^ ^ 
M^oo Ck,„,dM ("> SJMSR + (1 - shMBR) 

VI. Conclusions 

We have shown in this paper that when n, fc, and d are 
close to each other, the capacity of a distributed storage system 
when exact repair is assumed is essentially the same as when 
only functional repair is required. This was proved by using 
a specific code construction exploiting some already known 
codes achieving the MSR point on the tradeoff curve and by 
studying the asymptotic behavior of the capacity curve. 

However, when n, k, and d are not close to each other then 
the bound our construction gives is not good. So as a future 
work it is still left to find the precise expression of the capacity 
of a distributed storage system when exact repair is assumed, 
and especially to study the behavior of the capacity when n, 
k, and d are not close to each other 
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